1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 



„.CTTCT.T TX.CTTACTC CCT.T^^^^^^^^^^ OGCCC-^^^^^ 
CCGCCCAAGC CCCGCGTAAA CCTGGGTGAC ^TC^ ^^cCCGGCG 
CATGAGTTCC CGACATCAGG CGGCGGCGGT GGTCCGGG ^ 
GCGGGGAGAT AAGCCTGCCC ^^GAGGCAGG GGGCTGGGCT ^^^^^^ 
CCCCGCGCCT GACTTCGTTG GGGAGGGAGA CGCCCGGC ^^^^^ 
CTAGCCCAGC CGCGCGGAGC GCCTGGGAGA GGAGAAGGAG ^^^^^^^^^^ 
GAGATGGAGG CGACCGGCAC CTGGGCGCTG ^TG^l^ gGCCACCTGC 
CCTGCTGCTG ACGCTGGCGC ^GTCCGGGAC CAGGGCCCGA ^ ^^^^^^^ 
CCCCCGGGCC CACGCCGCTA GCACTGCTGG ^^AACCTC ^^^^^ 

CCCGGGGCGC tgtattcagg gctcatgcgg ctgagtaaga 

GGTGTTCACC ATCTACCTGG GACCCTGGCG ^CCTGTGGTG ^^^^^^ 

GGCAGGAGGC tgtgcgggag gccctgggag gtcaggctga ^^^^^^^ 
ggccggggaa ccgtagcgat gctggaaggg ^cttttga^ tttaccatgc 

651 TTTCTTCTCC AACGGGGAGC GGTGGAGGCA GCTGAGGAAG ^^^^^^^^^^ 
701 TTGCTCTGCG GGACCTGGGC ^TGGGGAAGC ^AGAA ^^^^gGACG 
751 CAGGCGGAGG CCCGGTGTCT GGTGGAGACA ^TCLAG ^^^^tCT 
801 CCCATTCGAT CCCTCCCTGC TGCTGGCCCA GGCCACCTC^ ..^^.tcCAG 
851 GCTCCCTCCT CTTTGGCCTC CGCTTCTCCT ^^G ^^^^ gCTCCCAGGG 
901 GCCGTGGTCC GGGCAGCTGG TGGTACCCTG ^^^^^^^ CTGCCAGGCC 
951 GGGTCAGACC TACGAGATGT TCTCCTGb CTTCACAGTC 
1001 CCCACAAGCA GCTCCTCCAC CACGTCAGCA gCCCCGCACG 
1051 CGGCAGGTGC AGCAGCACC^ ^f.^^^I^ GGCACAGGAG GAACAAAACC 
1101 TGACCTTGTC GATGCCTTCC TGCTGAA CATTTATTTG 
1151 CAGGCACAGA ATTCACCAAC A^^AACATGC ATACCCTCCT 
CTGTTTGCTG GGACGATGAC GGTCAGCACC ^CGGi^ gAGGAGCTGA 
GCTCCTGATG AAATACCCTC ATGTCCAAAA GTGGGTACGT ^^^^^^^^^ 
.TCGGGAGCT GGGGGCTGGC CAGGCACCAA ^CCTAG^^^^ ..cTGGCGCT 
CTCCCTTACA CCGACGCGGT TCTGCA1^«^ rrrrarCACC CGCTTCCGAG 
GGTGCCCATG GGAATACCCC GCACCCTCAT GCGGACCACC CG^^^^^^^^ 
1451 GGTACACCCT GCCCCAGGGC ACGGAGGTCT TCCCCC j^^^G 
1501 CTGCATGACC CCAACATCTT CAAGCACCCA ^AAGAG^^^^ ,,,cTGCCCT 
1551 TTTCCTGGAT GCAGATGGAC GGTTCAGGAA ^^^^^^^ ^aaagCGGAG 
TCTCCTTAGG GAAGCGTGTC TGL.Cii^^ rrCTTCTCCC TGGAGAGCCC 
CTCTTCCTCT TCTTCACCAC CATCCTACAA GCCTTCTCCC ^^^^^ 
GTGCCCGCCG GACACCCTGA GCCTCAAGCC CACCGTCAGT G ^^^^^ 
ACATTCCCCC AGCCTTCCAG CTGCAAGTCC GTCCCACTGA ^^^^^^ 
ACCACGCAGA CCAGATGAAG GAAGGCAACT ^GGAAGT^ artGTCTCCA 
GACGGTGCCT CCAGCCTCAA C^^^^^^^^^ SSgT TGTTTTCCGG 
GAGTGTACAC TGCAGGCAGC CACATllA^ acTCATGCTG CTAAGATGCA 
1951 AGTCTGTCCC ACGGCCCACA CGC^TG ACTCA^^^^ ^^^^^^^^^^^ 

2051 TAGCTTTCCA CAGACATAAA TATAGTCCAT CTGCAATC^^ ,,,gGAGTCT 

2101 CCAGGTAACC CACCAACTCC CCTGGATOlO ^ TCACAGCTCA 
2151 GGCTGTCACC TTCACAAGCC ACAGAAACGG CCACACATGT 

2201 CACGCCCTCT CCATTCATCG AACTTCTCAG TGTCCCTGTC ^^^^^^^^^ 

2251 GGCACAGGGA ACAGCATGCC CCCTCCGGGG TCATGC ^^^^^^^^^ 

CGCTGTCTAT GGCCCCAACT CATGCTCCCT CTO tGTCCACACA 

CCAGCCTGTG ACCACCGATG TCCACACACC CCCAACCAC ^^^^^^^^ 

gctacccacg tacgacatcg tcctggctcc ccagagt ^^g^^ 

GACACGCCGC CCCCACAGAG GCACAGTCCC CAGCCACCT^ ^.^^AAACAC 
GCCCTCAGTC ACCCCTTTTT AAGCACCCTG "TCTA ^TCA 
2551 ATCTGGGTCT GCGATTATGC ^^GAG^^T T^^^^^^^^^ ,;^CCACGTG 
2601 GACCGGAGGA ACACCTGCCC AACC-UOA^ -cATACACAC TCACTGATCT 
2651 GAAAGCGGCC CCTGCTGCCC CTCCACACAC ACATAC ^^^^^^^^ 
ACAGCCCCTG TTCGGCGTCA GAGTCCCCAC TAGA^__^^^^ ^^^.^TGAGCC 

AGAGACCAAG TAGGGGCCAG ^TTCLAAi rGGCTTGGGA ATTAAAGTTT x 



1201 
1251 
1301 
1351 
1401 



1601 
1651 
1701 
1751 
1801 
1851 
1901 



2301 
2351 
2401 
2451 
2501 



2701 
2751 
2801 
2851 
2901 



FEATURES: 

5'UTR: 1-303 
Start Codon:304 
Stop Codon; 1815 
3'UTR: 1818 
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HOMOLOGOUS PROTEINS: 

gi 1 117254 lsp|P24461|CPGl_RABIT CYTOCHROME P450 2G1 (CYPIIGl) (P.. 
gi|404777|gb|AAA31432,l| (L10912) cytochrome P-450 2B-Bx [Oryct.. 
gi|4799301pir| IS35666 cytochrome P450 2B4 isoform Bx - rabbit >.. 
gi|117212|sp|P001781CPB4_RABIT CYTOCHROME P450 2B4 (CYPIIB4) (P.. 
gi!320075|pirt IS31277 cytochrome P450 2B4-B1 - rabbit >gi|21369.. 
gi|47313501gb|AAD28466.11 {AF128849) cytochrome P450 2B10 relat.. 
gi|3123191!sp|P04167 1CPB2_RAT CYTOCHROME P450 2B2 (CYPIIB2) (P4.. 
gi|1172131sp|Pl2789|CPB5_RABIT CYTOCHROME P450 2B5 (CYPIIB5) (P.. 
gi |89973|pir M A27717 cytochrome P450 2B5, hepatic (form HPl) - .. 
gi 12144292 Ipirl I04RTP2 cytochrome P450 2B2 - rat 



ibTBEl48597|BE148597 MR0-HT024 l-150500-010-b02 HT0241 Homo sapi.. 
gb!BF359243|BF359243 RC6-ET0072-150600-011-F01 ET0072 Homo sapi.. 
gb|AW75377BlAW753778 RC1-CT0286-301099-01 1-f 08 CT0286 Homo sapi.. 

EXPRESSION INFORMATION FOR MODULATORY USE: 

gblBE148597| head_neck 
gb|BF3592431 lung_tumor 
gblAW753778| colon 

Tissue expression : 
Human leukocyte 



516 e-145 

513 e-144 

513 e-144 

508 e-143 

508 e-143 

508 e-143 

508 e-143 

507 e-142 

507 e-142 

506 e-142 



1091 0.0 
779 0.0 
652 0.0 
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1 MEATGTWALL LALALLLLLT LALSGTRARG HLPPGPTPLP LLGNLLQLRP 

51 GALYSGLMRL SKKYGPVFTI YLGPWRPVVV LVGQEAVREA LGGQAEEFSG 

101 RGTVAMLEGT FDGHGVFFSN GERWRQLRKF TMLALRDLGM GKREGEELIQ 

151 AEARCLVETF QGTEGRPFDP SLLLAQATSN VVCSLLFGLR FSYEDKEFQA 

201 VVRAAGGTLL GVSSQGGQTY EMFSWFLRPL PGPHKQLLHH VSTLAAFTVR 

251 QVQQHQGNLD ASGPARDLVD AFLLKMAQEE QNPGTEFTNK NMLMTVIYLL 

301 FAGTMTVSTT VGYTLLLLMK YPHVQKWVRE ELNRELGAGQ APSLGDRTRL 

351 PYTDAVLHEA QRLLALVPMG IPRTLMRTTR FRGYTLPQGT EVFPLLGSIL 

401 HDPNIFKHPE EFNPDRFLDA DGRFRKHEAF LPFSLGKRVC LGEGLAKAEL 

451 FLFFTTILQA FSLESPCPPD TLSLKPTVSG LFNIPPAFQL QVRPTDLHST 



FEATURES : 

E*unctional domains and key regions : 

Prosite search results: 

[1] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 6 



3 248-250 TVR 

4 288-290 TNK 

5 378-380 TTR 

6 473-475 SLK 



[3] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 

Number of matches: 3 

1 119-122 SNGE 

2 192-195 SYED 

3 343-346 SLGD 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 10 



1 


51 


-56 


GALYSG 


2 


109- 


114 


GTFDGH 


3 


115- 


120 


GVFFSN 


4 


188- 


193 


GLRFSY 


5 


207- 


212 


GTLLGV 


6 


257- 


2 62 


GNLDAS 


7 


284- 


289 


GTEFTN 


8 


339- 


344 


GQAPSL 


9 


370- 


375 


GIPRTL 


10 


444- 


449 


GLAKAE 



[5] PDOC00009 PS00009 AMIDATION 
Amidation site 

Number of matches: 2 

1 140-143 MGKR 

2 435-438 LGKR 



[6] PDOC00029 PS00029 LEUCINE_ZIPPER 
Leucine zipper pattern 

Number of matches: 2 

1 32-53 LPPGPTPLPLLGNLLQLRPGAL 

2 39-60 LPLLGNLLQLRPGALYSGLMRL 



501 




128-131 RKFT 



1 
2 



61-63 SKK 
99-101 SGR 
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[7] PDOC00081 PS00086 CYT0CHR0ME_P4 50 

Cytochrome P450 cysteine heme-iron ligand signature 

433-442 FSLGKRVCLG 

Membrane spanning structure and domains: 



Helix 


Begin 


End 


Score 


Certainty 


1 


5 


25 


2.305 


Certain 


2 


64 


84 


0.783 


Putative 


3 


170 


190 


1.041 


Certain 


4 


292 


312 


2.031 


Certain 


5 


448 


468 


1.133 


Certain 
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BLAST Alignment to Top Hit: ' 

>gi|117254|sp|P24461|CPGl_RABIT CYTOCHROME P450 2G1 (CYPIIGl) 
(P450-NMB) (OLFACTIVE) 
pir|!B31944 cytochrome P450 2G1 - rabbit 
Length = 494 

Score = 516 bits (1315), Expect = e-145 

Identities = 248/491 (50%), Positives = 345/491 (69%), Gaps = 3/491 (0%) 

Query: 1 MEATGTWALLLALAL-LLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMR 59 

ME G + + LAL LL+ +A + G LPPGPTP+P LGNLLQ+R A + ++ 
Sbjct: 1 MELGGAFTIFLALCFSCLLILIAWKRVQKPGRLPPGPTPIPFLGNLLQVRTDATFQSFLK 60 

Query: 60 LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFS '119 

L +KYGPVFT+Y+GP RPVV+L G EAV+EAL +A+EFSGRG +A +E F GHGV •+ 
Sbjct: 61 LREKYGPVFTVYMGP-RPWILCGHEAVKEALVDRADEFSGRGELASVERNFQGHGVALA 119 

Query: 120 NGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATS 179 

NGERWR LR+F++ LRD GMGKR EE IQ EA L+E F+ T+G P DP+ L++ S 
Sbjct: 120 NGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGYLLEEFRKTKGAPIDPTFFLSRTVS .179 

Query: 180 NVVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQTYEMFSWFLRPLPGPHKQLLH 239 

NV+ S++FG RF YEDK+F +++R + + +S+ Q Y+M+S ++ LPG H ++ + 
Sbjct: 180 NVISSVVFGSRFDYEDKQFLSLLRMINESFIEMSTPWAQLYDMYSGVMQYLPGRHNRIYY 239 

Query: 240 HVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNPGTEFTNKNMLMTVIYL 299 

+ L F +V+ ++ +LD P RD +D FL+KM Q++ NP TEF KN+++T + 
Sbjct: 240 LIEELKDFIAARVKVNEASLDPQNP-RDFIDCFLIKMHQDKNNPHTEFNLKNLVLTTLNL 298 

Query: 300 LFAGTMTVSTTVGYTLLLLMKYPHVQKWVREELNRELGAGQAPSLGDRTRLPYTDAVLHE 359 

FAGT TVS+T+ Y LL+MK+P VQ + EE+N+ +G + PS+ DR ++P+TDAV+HE 
Sbjct: 299 FFAGTETVSSTLRYGFLLIMKHPEVQTKIYEEINQVIGPHRIPSVDDRVKMPFTDAVIHE 358 

Query: 360 AQRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILHDPNIFKHPEEFNPDRFLD 419 

QRL +VPMG+P ++R T FRGY LP+GT+VFPLLGS+L DP F HP++F P FLD 
Sbjct: 359 IQRLTDIVPMGVPHNVIRDTHFRGYLLPKGTDVFPLLGSVLKDPKYFCHPDDFYPQHFLD 418 

Query: 420 ADGRFRKHEAFLPFSLGKRVCLGEGLAKAELFLFFTTILQAFSLESPCPPDTLSLKPTVS 479 

GRF+K+EAF+PFS GKR+CLGE +A+ ELFL+FT+ILQ FSL PP + + P +S 
Sbjct: 419 EQGRFKKNEAFVPFSSGKRICLGEAMARMELFLYFTSILQNFSLHPLVPPVNIDITPKIS 478 

Query: 480 GLFNIPPAFQL 490 
Sbjct: 479 GFGNIPPTYEL 489 
Hmmer search results (Pfam) : 

Scores for sequence family classification (score includes all domains) : 
Model Description Score E-value N 



PF00067 Cytochrome P450 594.4 6.9e-175 1 

Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



PF00067 1/1 33 493 .. 1 497 [] 594.4 6.9e-175 
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1 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1901 NNNNNNNNNN NNNNNNNNNN NNNNNTGACA GGGGCCATGA TGGAGACACC 
1951 TTGGATCGAA GAGGTCACAG CACCCTCCTC TTTCTTCCTC CCTACCCCCA 
2001 GCTGAGTAAG AAGTACGGAC CGGTGTTCAC CATCTACCTG GGACCCTGGC 
2051 GGCCTGTGGT GGTCCTGGTT GGGCAGGAGG CTGTGCGGGA GGCCCTGGGA 
2101 GGTCAGGCTG AGGAGTTCAG CGGCCGGGGA ACCGTAGCGA TGCTGGAAGG 
2151 GACTTTTGAT GGCCATGGTA AGTCAAGGGC TGCTAGGCCC TCCGCTCACA 
2201 GCCTGCCACC ACTTACTGGT GTGTGACCTT TGCACATGGC TTAGTCCCTC 
2251 TGTTGCCTCA TCTGTCAAAT GGAGTGATAA CAGTGCCCAT CAGCCGGGTG 
2301 CAGTGGCTAG TGCCTGAAAT CCCAACACTT TGGGAGGCGG AGGTGGGTGG 
2351 ATCACTTGAG GTCAGGAGTT CGAGACCAGC CTGGCCAACA TGGTGAAACC 
2401 CTGTCTCTAC TAAAAATATA AAAATTAGCT GGGCATGGTG GTGCGTACCT 
2451 GTAATCCCAG ATACTTGGGA GGTTGAGGCA GGAGAATCGC TTGAACCCGG 
2501 GAGGCAGATG TTGCAGTGAA CCAAGACTGT GCCACTGCAC TCCAGTCTGG 
2551 GCAACAGAGT GAGCCTCCAT CTCAAACAAA CAAACAAAAA GCAGTGCCCA 
2601 TCATGTAGGA TTGAGTGATT GAGTGAGGAC TGAGCCTTGT GCAAAGTGAG 
2 651 CACTCACTAA TCACCAGGTT GTAGTATCAG TGATAACCAT CAATGATCCA 
2701 GGTAAAGCCC TGAGGGTTCA GAAAGATGCC GGAGCGCTTT CAAGGTGCTG 
2751 GGGATTGGTG GGCAAGCCCT CGAATAATAG AAACAGTTCT CTGTATTACA 
2801 ACAGAAAGCA GGAGGCCCAT GCTGGGTGCT GCCAGGAACT CAGTAGTAAC 
2851 TAAGACAGCA CCGGTGCTGC TTCCCCAGCG CACCTAGGCC AGTGGGGAAA 
2901 CAGACTCACC ACACAGTCCC AGCCCAGAGT GGTCAGGGCC AAGATGGGGA 
2951 AGCACGGGGA GAAAGGTCAG GGTGGGATGG GGAGGGGTCA GGGCAAGAGG 
3001 GGTCAGGGCC AGGCTGAGGG AAGCCCTGGG ACTGTAGGAA TTTAGAGGAG 
3051 GTACCTGACC CGGCATGTTT GGTGAGGGAG ATTCAGGAAG TCTTCCTGGA 
3101 AGAGAGGCTG TCGGAGCTGA GACTCATAAG ATGAGTGGGG AGGGTGTTCC 
3151 AGGCAGAAAG ACCAGCACCT ACAAAAGCAT GACTTTGAGA GAAGCATTCA 
3201 TCCATTCAAC TGATGAATTT TCAGACTGGG CACGCTGGCT CATGCCTGTA 
3251 ATCCCAGCAC TTTGGAAGGC TGAATGGGGA GGATGACTTG AGCCTAGGCA 
3301 TTTGTGACAA GCCTGGGCAA CATGGTGAGA CCCTGCCTCC ACAAAACAAA 
3351 CAAACAAACA AAAAATCATT ATACCTGGTA CCATGGGTAC CAGGTACATA 
3401 GAAATGACTC AGGCAGATAT GGTGTCCTCT CCTACTGTGG GAGAGGCGGG 
3451 CTTATACTGC AGTAAGACAA TAGAGGGAGG GAATATAATC CTAAAATGAG 
3501 AGGTACAGAT TTGAGAGCAA ACACAGGGCA CAGGCATATG TACGAGGGTA 
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3551 AAGAGGGAAT CAGGGAAGGC TTCTCAGAGA AGGTGACATT TAAGCCGGGA 
3601 CATGAAGGAT GAACGAGTTA GTTCACCAAG GATGGGATGG AAAGGGGTGA 
3651 GAGTGATGGA GGCAGAGGGA ACTGCAGGAT CATAGGCCTA GACAGGGGAT 
3701 CCTGACGCCC TTGAGGAAGT GAGAGAAGAC CAGCGCAGTC GTAGTGGGTT 
37 51 AAGTAACAAA GCTGAGAAGC CAGGGAAATC CCTGGTCATG CAGGGCCTGT 
3801 GAGTCACGTC AGAGTGTTTG GGCTTTTGTT TTTCCTGGGA GCAGTCGATT 
3851 TTAAGCAGGG AACAGCTGTA TTCAGAGTTG GGAAGATCCT GTGGTTGCTG 
3901 CCTGAAGGGG ATGAAACTGG AGGCTAGGAG CCCAGGGTGA TAGGGAGGAT 
3951 CCAGGGTGAT GGGGAGGCTG GGAGGTCCGC GGTGATGGAC CAGGGCTGGG 
4 001 GCCAGGGGAT GGGGAGGAAG GAGTAATTGG GAGAGGCCTG GGGCTCTGGC 
4051 CGAGGAATGG ATGGTGGGCT GAAACAGGGA GAGGAGAGAT GCTTAGGCCA 
4101 CTTTGGAACA CAGTAGGGCA AGGACAGGAG ACACCCAAGG GGAAGTGCCC 
4151 AAGAGACCAC GACAGGCTGG CATTGGACAG GGAAGGTCTG TCTGGAGCAG 
4201 GTGTCTTGGA TAAGGGAGGA AAATGGTGCA GTTCCATCCT CCTCCCTCTC 
4251 TGTTCAACCT CTAAACTACA TGGGGCACAG GACCCAGTGG GACTCCATAA 
4301 ATGATGGGAT GGGTGGATGG AAGGAAGGAA GGAGGAAACA ACTCTTCATT 
4351 CATCCTGGTT ATTTACAGAA CAGGCCAGGT GCGGTGCTCA CGCTTGCCAT 
4401 TCTAGCACTT TGGGAGGCTG AGGTGGGTGG ATTACCTCAG GTCAGGAGTT 
4451 CAAGACCAGC CTAGACAACG TAGAGAAACC CCATCTCTAC TGAAGATATA 
4 501 AAATTAGCTG GGCGTAGTGG CATATGCCTG TAATCCCAGC TAGTCGGGAA 
4551 GCTGAGGCAG GAGAATCGCT TGAACCCGAG AGGCAGAGGT TGCGGTGAGC 
4 601 TGAGATCGTG CCATTGCACT CCAGCCTGGG TGACAAAGCA AGACCTCGTC 
4 651 TCAATAATAA TAATAATTAC AAAACAGAAG GAGCCTGGGT CATCCCAGCT 
4701 ACCTACTTTT CAGGAGAATG TACTCCCTTA CCCAAGGGCA AAGGATGGGA 
4751 GAACCAGTTT GATTATGCAT TTATTGAGCA CCTACTGAGT CCTCATCCCT 
4801 GGGCTAGGCT GGAATGGACT CAGATGGAGC CTGAAGAGTC CCCCTCAGGG 
4851 AACCTCACTA GAAAGAAGGA GGAATCGGCC GGGCGCGGTG GCTCACGCCT 
4 901 GTAATCCCAA CACTTTGGGA GGCTGAGGTG GGTGGATCAC AAGGTCAGGA 
4951 GATCGAGACC ATCCTGGCTA ACACAGTGAA ACCCCATCTC TACTAAAAAT 
5001 ACAAAAAATG AGCCAGGCAT GGTGGCGGGC GCCTGTAGTC CCAGCTACTC 
5051 AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCA GAGGTTGCAG 
5101 TGAGACGAGA TCACGCCACT GCACTCCAGC CTGGGCAACA GAGCGAGATT 
5151 CCGTCTCAAA AAAAAAAAGA AAGAAAGGAA GAAGGGGGAA TGGGGGAGAG 
5201 GGGCCGGTCC CTTTTTGAGT CTAGCCTTCT GCGCAGGGGT TTTCTTCTCC 
5251 AACGGGGAGC GGTGGAGGCA GCTGAGGAAG TTTACCATGC TTGCTCTGCG 
5301 GGACCTGGGC ATGGGGAAGC GAGAAGGCGA GGAGCTGATC CAGGCGGAGG 
5351 CCCGGTGTCT GGTGGAGACA TTCCAGGGGA CAGAAGGTCA GCATGGCGGG 
5401 GTCACCCCAG GGTCTCCAGC CGAGTGAAAG GGAAAACTCT CCTACTGTGG 
5451 CTGGGGGTGG CCCCAACCCA GGTCCTGGAA TGGGCAGGAG GGGAAGCCTT 
5501 GAACTCTAGG GCTGGCCTGG GGGTTCTGTT CACTGCCACC TTCTGTCTCT 
5551 GTCCCACTGT CTCTCCGAGG CTGTCATGAC ATCTCTCTGT GTGTCTCTGG 
5601 TGCTATCATC CCATTCTTCC TGGGTCTCCA TCTCTCTCTC TGTCTCTTTT 
5651 CTTTCTCTCT CCTTTCCTCT ATTTTTTGGG CCCTCAGTCT ATCTCTGTTT 
5701 CTGTCTCCCT GTCTGTGTGA TGGTCACTCT GTTTCTTTCT CCCTGTCTGT 
5751 TTCTCTGTCC CTATCTGTCT GTATCCTTCT TTGCCTGTTT AGCTCTCTCC 
5801 CTGCGCTGTC CATCCATCTT TCCCTGCCTC CCTGTCTCTC TCTGGTTGGG 
5851 TTCAGCCCCA ACCTGCTCCC CTCTGCCTGG CTCCATCACA GCCTACCTCC 
5901 CTGCCCCCAT TCCCCCCAGG ACGCCCATTC GATCCCTCCC TGCTGCTGGC 
5951 CCAGGCCACC TCCAACGTAG TCTGCTCCCT CCTCTTTGGC CTCCGCTTCT 
6001 CCTATGAGGA TAAGGAGTTC CAGGCCGTGG TCCGGGCAGC TGGTGGTACC 
6051 CTGCTGGGAG TCAGCTCCCA GGGGGGTCAG GTGAGTGGGT GGGACCCCTC 
6101 TCCAACTACC TTCCCTGAAG GTTCCTGCCA AGGTCCCATG AGAACTAGCT 
6151 GCCCTTCTCC CCACAGACCT ACGAGATGTT CTCCTGGTTC CTGCGGCCCC 
6201 TGCCAGGCCC CCACAAGCAG CTCCTCCACC ACGTCAGCAC CTTGGCTGCC 
6251 TTCACAGTCC GGCAGGTGCA GCAGCACCAG GGGAACCTGG ATGCTTCGGG 
6301 CCCCGCACGT GACCTTGTCG ATGCCTTCCT GCTGAAGATG GCACAGGTGT 
6351 GGGAAGGGTG CAGGGACCCC CTCTCTGAAT GGGCGTGGTG ACCTGGCAGG 
6401 TCCCAGCCAG GTGTCCCTGG GGACCTCAAT TGGGTTCCTC TCTCTTTCTC 
6451 TCTCTGCATG TCTCTGTGAG TATGAGTGTC TCTGTGCATG TGTGTGCATC 
6501 CCTTCTCTGC ACATCTGTGC TGGCCCTTTC AGGGCGTTGC TCTCACTGCC 
6551 TCTCCCGCCC CCGACCTGGG CATTTGTGCC GGGCTGTCTG TCTCTCCAGC 
6601 ATCTCTCCTC TTTCTCCCTC CCACCTCGGC CCTTGTGTTC AGGCCCCATG 
6651 CCCAGGGTCC TACACCAGCA ATCCCCAGGA TCACTTCATC CCATCCCCTG 
6701 CAGCCTCCCC AGACTTTTAT GTAAATTCAC AATTTTATGT GAATTATGGT 
6751 CATTTATTAG GAAGCCTTGC AATATCAAGT TATGTTAATA AAGTCCACTT 
6801 TATTAATTAT ATAAGAACAA TATTTCTTTT CCTTTTTTTT TTCTTTTCTT 
6851 TTTAAAGAGA CAGGATCTCT TTCTGTTGCC CAGGCTAGAG TACAGTTGCA 
6901 AAATCATAGC TCACTGCAAC CTTGAACTCC TGGGCTCAAG CAATCCTCCT 
6951 GCCTCGGGCT CCTGAGTAGC TGGGACAACA GGTGTGCACC ACCACACCTG 
7001 GCTAAATTTT TTTTTTTTCT TTGTAGAGAT AGACTCTCAC TATGTTACCC 
7 051 AGGCTGGTCT TGAATTCCTG GGCTCATGTA ATCCTCCTGC TGCCTTGAAC 
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7101 TCCCAAAGTG CTGGGACTAT AGGCATAAGA CATCATGCCC GGTCGGGCAC 
7151 AGTGGCTCAT GCCTGTAATC TCAGGACTTT GGGAGGCCGA GACGGGCGGA 
7201 TCACCTGAGG TCGGGAGTTC GAGACCAGCC TGACCAACAT GGAGAAACCC 
7251 CATCTCTACT AAAAAAAAAA ATACAAAATT AGCCGGACGT GGTGGCACAT 
7 301 GCCTGTAATC CCAGCTACTA GGGAGGCTGA GGCAGGAGAA TCGCTTGAAC 
7351 CCGGGAGGCT TAGGTTGCGG TGAGCTGAGA TTGCACCATT GCACTCCAGC 
74 01 CTGGGCAACA AGAGCGAAAT TCCATCTCAA AAAAAAAAAA AAAGAAAAAA 
7451 AGAAAAAAGA CACCATGCCC TATAAGTAAA CTAGAATTAA GGTGACTCCT 
7501 AAGGAAATAA ATAGTTTTTA ACTGTACGAA CTTTTGGAAG AATGGGGCCA 
7 551 ATTCTTTAAT TAAATGCAGC CTCCCTGTTT GTGGAGAAAG AAAAATTTTT 
7 601 CTTAACCCTA TTGCCCCATT TCTTTTCTCT TTTATTGAAT ATTTTTTAGT 
7651 TTTAACTATA GTAAAATACA CATAACGTTT ACCATCTTAA CCATTTTTAG 
7701 GTATACAGTA CAGTAGTGTT CAGTACATTC ATACTGTTAT GCAATCAGTC 
77 51 TCCAGAACTC TTCATGTTGC AAAGCTGAAA CTCTATACCC ATTAAACAAC 
7801 TGCCTGTTCC TCCCTCCTCC AACCCCTGGC AATCACCTTT TTTTTTTTGA 
7851 GACGAAGTCT CACTCTGTCA CCCAGGCTAG AGTGCGGTGG CTCGATCTCG 
7901 GCTCACTGCA AGCTCCGCCT CCCGGGTTCA TGCCATTCTC CTGCCTCAGT 
7951 CTCCCAAGCA GCTGGGACTA CAGGTGCCCG TCACCACGCC TGGCTAATTT 
8001 TTTGTATTTT TAGTAGAGAT GGAGTTTCAT CGTGTTAGCC AGGCTGATCT 
8051 CAAACTCCTG GCCTCAAGTG ATCCACCCGC CTCGGCCTCC CAAAGTGCTG 
8101 GGACTACAGG CGTGAGCCAC TGTGCCTGGC CAGGAAGTAG ACTCTTGATA 
8151 TTAGTTCTCT CTGGTTGAAA TGTTTTTAAA AATGAAAGAG AATGACTAAT 
8201 AACAAAAACA CAGAAAGTTA TAAGGATTGA TGAAGATGTG GAGACTTTGA 
8251 AACCCATGTA TACCATTGGT GGGAATGTGA AACGACGCAG CCCTGTGGAA 
8301 AATGGTACAG CAGTTACCTG AGGTCAGGAG TTTGAAACCA ACCTGGCCAA 
8351 CATGCAGAAA CCCCGTCTCC ATTAAATGTA CAAAAATTAG CCAGGNNNNN 
8401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
84 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
3151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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10651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

114 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12601 NNNNNNNNNN NNNNNNNNNN NNNNNNCTCT TTCTTCCTTT CTTCCTTCCT 

12651 CCCTTCCTCC CTTCCTCCCT TCCTGTCTTC CTCTCTTTCT CTCTTTCTTT 

12701 CTTGACAGGG TCTCTCTTTG TCTCCCAGGC TGGGGTGCAG TGGTACAAGC 

12751 ATAGCTCACA GCAGCCTTGA ACTCCTAGGC TCAAGTGATC CTCCCACGTC 

12801 AGCCTCCTGA GCAGCTGGGA CAACGGGCTC ATACCACCAT GCCTGGCTAA 

12851 TTTTTTAATT TTTCGTAGAG ACAAGGTCTT GTTATATTGC CCAGGCTGGT 

12901 CTCAAACTCC TGGGCTCAAA TGCTTCTCTC ACCTCAGCCT CCCACGTGGC 

12951 TGGGATTACA GGCATGAGCC ACTGCACGCC ACTCAACACT CCACAAATGT 

13001 TGATGCCATT ATGTTTTGTG AACTAGTGTC CCTGGCACCC GAGACTTGTA 

13051 CTCCACACTC GAGGACCAAA TAGACTGGGG TGGGAAGGGG TTTATAGTTT 

13101 CATTATTATT TCCCCTCAGG GCACGGAGGT CTTCCCCCTC CTTGGCTCCA 

13151 TCCTGCATGA CCCCAACATC TTCAAGCACC CAGAAGAGTT CAACCCAGAC 

13201 CGTTTCCTGG ATGCAGATGG ACGGTTCAGG AAGCATGAGG CGTTCCTGCC 

13251 CTTCTCCTTA GGTATCTGCT GCAGCCCTGG GTATCACAAG CAGGTGCTGG 

13301 CGAACTCCAG GCATCTGTGC CAGCTGGGGG CACCCTTCTG CACCCTGGGC 

13351 TTACTGTTGG CTCCTCCACC TGCTGTTCCC CCCGTGGGCC TGGGTGTGAG 

13401 GAATACTGAC TCAGCCCTCT CTCTCTCTCT CTCCTCACCA GGGAAGCGTG 

13451 TCTGCCTTGG AGAGGGCCTG GCAAAAGCGG AGCTCTTCCT CTTCTTCACC 

13501 ACCATCCTAC AAGCCTTCTC CCTGGAGAGC CCGTGCCCGC CGGTACACCC 

13551 TGAGCCTCAA GCCCACCGTC AGTGGCCTTT TCAACATTCC CCCAGCCTTC 

13601 CAGCTGCAAG TCCGTCCCAC TGACCTTCAC TCCACCACGC AGACCAGATG 

13651 AAGGAAGGCA ACTTGGAAGT GGTGGGTGCC CAGGACGGTG CCTCCAGCCT 

13701 CAACAGTGGG CATGGACAGG GTTAATGTCT CCAGAGTGTA CACTGCAGGC 

13751 AGCCACATTT ACACGCCTGC AGTTGTTTTC CGGAGTCTGT CCCACGGCCC 

13801 ACACGCTCAC TTGACTCATG CTGCTAAGAT GCACAACCGC ACACCCATAC 

13851 ACAACTACAA GGGCCACAAA GCAACTGCTG GGTTAGCTTT CCACAGACAT 

13901 AAATATAGTC CATCTGCAAT CACAAGCACA TAGCCAGGTA ACCCACCAAC 

13951 TCCCCTGGAT CTGCAGCCCA CACGTGGGAG TCTGGCTGTC ACCTTCACAA 

14001 GCCACAGAAA CGGCCACACA TGTTCACAGC TCACACGCCC TCTCCATTCA 

14051 TCGAACTTCT CAGTGTCCCT GTCCCTGGTG CCTGGCACAG GGAACAGCAT 

14101 GCCCCCTCCG GGGTCATGCC ACCCAGAGAC TGTCGCTGTC TATGGCCCCA 

14151 ACTCATGCTC CCTCTCTTGG CTACACCACT CTCCCAGCCT GTGACCACCG 




14201 ATGTCCACAC ACCCCCAACC ACTTGTCCAC ACAGCTACCC ACGTACGACA 
14251 TCGTCCTGGC TCCCCAGAGT ATCTTCCCAC TGAGACACGC CGCCCCCACA 
14 301 GAGGCACAGT CCCCAGCCAC CTCTGCAACT GCAGCCCTCA GTCACCCCTT 
14351 TTTAAGCACC CTGATTCTAC CAAATGCAAA CACATCTGGG TCTGCGATTA 
14 401 TGCACAGAGA CTTTGGACAT ACGAGGACCC TCAGACCGGA GGAACACCTG 
14451 CCCAACCCCA ACACGTGCTT ATGTAACCAC GTGGAAAGCG GCCCCTGCTG 
14 501 CCCCTCCACA CACACATACA CACTCACTGA TCTACAGCCC CTGTTCGGCG 
14551 TCAGAGTCCC CACTAGACCC AGTGGAAGGG GTTAGAGACC AAGTAGGGGC 
14 601 CAGTTTCCAA TTCACCCTGT CAGGGAGTGA GCCGGATCTG ACGTTCCTTG 
14 651 TGACTTAAGG GTCCGGCTTG GGAATTAAAG TTTGTTTCTG GCCTTTAGCC 
14701 TACTGCGTGT GTGACCCGTG TCAGTCACTG TGAGTAAGGG GTGGGGACAG 
14751 GGGAGTCCAC CCCTCCCCTG AGGCTGGGCG GGAGCTGAAA AACATGGCCA 
14801 CCGCCCACCC TGGCTGTTGA CATCAGGACC AGATGTGGAG CTGGGAGGAG 
14851 GGGCAGGGCT GGTGACGCCC TGGGCCTCAT TTCCAAAAAG GGCCAAGGTG 
14 901 TCCGGCGGTG GGAAGTGGGC AAGGAGGGGG TAACCCAAGC TGGACTGTGG 
14 951 ACCTTGGGGG CTTCCTCAGC CAGGGAGAGC CTGAAGCCAA CTAGATCCAG 
15001 ACCCTAGAGA CTCTTCAAAC TTGAGTACAG GAACTAGCTT GCAACACAGA 
15051 CTCTAAGCCC ACTCCCATTT CTTCCACCCT TTTTCTCTTG CCTCCCCTTC 
15101 ACAAGGAAAC CAGAGGCATT TGTAAATTTC CTTTCTTTTT tTTTTTTTTT 
15151 TTTTTTTTGA GACGGAGTCT CACTCTGTCA CCCAGGCTGG AGTGCAGTGG 
15201 TGTGATCTTG GCTCACTGCA GCCTCCGCCT CCGGGTTCAA GCCATTCTCC 
15251 TGCCTCAGCC TCCCAAGTAG CTGGGATTAC AGGTGTGTGC CACCACGCCC 
15301 AGCTAATTTT TGTATTTTTA GTAGAGATGG GGTTTCACCA TGTTGGCCAG 
15351 GCTGGTCTCG AACTCCTGAC CTCAGATGAT CTGCCAGTCT CGGCCTCCCA 
15401 AAATGCTGGG ATTACAGGCG TGAGTCGCTA CTAGATAAAT TTCTTATCTA 
15451 GCAAAGAAGT TTGCAAACAT ACGCAAAAGT AGAAAGATAC AATGAGCCCC 
15501 CAGGTGCCCA TCACCCAGCC TCATTTCAAT AGTCATCAAC TTTCTGCAGC 
15551 TTTTACTTCA TCTATATCCT TTTCTGCCTC TTTTTTTTTT TTTTTATTTT 
15601 GAGATAGGGT TTTGCTTTGT TGCCCAAGCT GGGGTGCAGT AGCATGATCT 
15651 CATAGTTCAC TGGGGCTTCA GACTCCTAGG CTCAAGTGAT CCTCCCGCCT 
15701 CGGCCTCCAA GCAGCTGGGA CTACAGATGC GTGCCACCAC ACCCAGCTAA 
15751 ATTTCTTATT TTTATTTTCT ATAGAGAAAG TCTCACTATA CAGCCCTGTG 
15801 CTGGTCTCAA ATTCCAGGCC TCAAGAGTTT CCATCCCAGC CTCCCAAAGT 
15851 GCTGGGATTA TAGGCGTGAG TCACTGCACC CTGCCCTAAT ATTTTTATTT 
15901 TATCTATTGC TTTTTATTTA CTTATTTATC TTTTATTTTT GAGACAGAGT 
15951 CTCACTCTGT GGCCCATGCT GGAGTGCAGT GGCATCATCT CGGCTCACTG 
16001 TAACCTCCGC CTCTTAGGTT CAAGCAGTTC TCCTGCCTTG ACCTCCCGAG 
16051 TAGCTGGAAT TACAGGTGCC TGCCACCAAG CCTGGCTAAT TTTTTTATTT 
16101 GTAGTAGAGA TGGGGTTTTG CCATGTTGAC CAGGCTGGTC TCGAACTCCT 
16151 GACCTCAGGT GATCTGCCCA CCTTGGCCTC CCAAAGTGCT GAGATTACTG 
16201 GTATGAGCCA CCGTGCCTGG CCACCTATTG CTTTTTAAAG ATTATTTTTT 
16251 TATTATTATT ATTTTTTTAT TTGCAGATGG AGTTTCGCTA TTGTTGCCCA 
16301 GGCTGGAGTG CAATGGCGTG ATCTCAGCTC ACCGCAACCT CCGCCTCCCA 
16351 GGTTCAAGCG ATTCTCCTGC CTCAGCATCC CTAGTAGCTG GGATTACAGG 
16401 CATGCACCAC CATGTCCAGC TAATTTTGTA TTTTTAGTAG AGACGAGGTT 
16451 TCTCCAGGTT GGTCAGGCTG GTCTCAAACT CCCAACCTCA GGTGATCCGC 
16501 CCATCTCGGC CTCCCAAAGT GCTGGGATTA CAGGTGTGAG CCACCGCGCC 
16551 TGGCCTTAAA GATTATTTTA AGGCAAATTA CAGAAAGCAA TTTAATGCAC 
16601 ATTTCTGAGA GTTAAAGATA TTTTTGCCCT TGACATTTTA TGAGGACAGT 
16651 TTTCAAACAT GCAGCAAAGT TGAGGGAATT GTACAAGGAA CACCTTGTGC 
16701 ACTCCTGCCT CAGTCTCCCA AGCAGCTGGG ACTACAGGTG CCCGTCACCA 
16751 CGCCTGGCTA ATTTTTTGTA TTTTTAGTAG AGATGGAGTT TCATCGTGTT 
16801 AGCCAGGCTG ATCTCAAACT CCTGGCCTCA AGTGATCCAC CCGCCTCGGC 
16851 CTCCCAAGTG CTGGGACTAC AGGCGTGAGC CACTGTGCCT GGCCAGGTAA 
16901 GTAGACTCTT GATATTAGTT CTCTCTGGTT GAAATGTTTT TAAAAATGAA 
16951 AGAGAATGAC TAATAACAAA AACACAGAAA GTTATAAGGA TTGATGAAGA 
17001 TGTGGAGACT TTGAAACCCA TGTATACCAT TGGTGGGAAT GTGAAACGAC 
17 051 GCAGCCCTGT GGAAAATGGT ACAGCAGTTA CCTGAGGTCA GGAGTTTGAA 
17101 ACCAACCTGG CCAACATGCA GAAACCCCGT CTCCATTAAA TGTACAAAAA 
17151 TTAGCCAGGC ATGGTGGTGC GCACCTGTAA TCCCAGCTAC TCGGGAGGCT 
17201 GAGGCAGGAG AATTGCTTGA ACCCAGGAGG CGGAGGTTGC AGTGAGCCGA 
17251 GATCGTGCCA CTGCACTCAG CCTGGGCAAC AAAGCAAGAC TCTGTCTCAA 
17301 AAAAAAAAAG TCTACTTCCC AACCTTCCCA AAAATTTATC TAAACCCCGT 
17351 GACAAAACTT TAACTTGTGT TTCCGACCCC AGGCTTGGCT GTTCTGGACA 
17401 TTTACTTCCC AAAGGCTGTG TTCTCTCAGC CCCTCTGCCT GGTTTCTTTC 
17451 AGGAGGAACA AAACCCAGGC ACAGAATTCA CCAACAAGAA CATGCTGATG 
17501 ACAGTCATTT ATTTGCTGTT TGCTGGGACG ATGACGGTCA GCACCACGGT 
17551 CGGCTATACC CTCCTGCTCC TGATGAAATA CCCTCATGTC CAAAGTAAGA 
17601 GCCTTTTCCA CTTGCCAGGC CTTGGGAACA GAAGTCAGGG TTCTAGGCTG 
17 651 AGCAAGGTGG CTCACGCCTA TAATCCCAGC ACTTTGGGAG GCTGAGGCGG 
177 01 GCTGATCACT TGAGAATAGG AGTTTAAGAC CAGCCGGCCA ACACAGTGAA 
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FEATURES: 










Start: 




1999 




6349 


Exon : 




1999 




2167 


Intron : 




2168 




5236 


Exon: 




5237 




5386 


Intron : 




5387 




5919 


Exon : 




5920 




6080 


Intron : 




6081 




6166 


Exon: 




6167 




6349 


Intron : 




6354 




1 T R /I O 


Exon: 




6354 




6636 


Intron : 




6637 




13087 


Exon : 




13088 




13261 


Intron : 




13262 




13441 


Exon: 




13442 




13542 


Intron : 




13547 




13648 


Exon : 




13547 




13648 


Stop : 




13649 






SNPs: 










DNA 










Position 


Major 


Minor 


2226 


G 




C 


A 


2226 


C 




A 


G T 


3081 


A 




T 


G C 


3788 


A 




T 




3979 


T 




C 


G 


5056 


T 




C 


G 


5213 


T 




G 


A C 


5508 


A 




C 


T G 


5857 


C 




G 


A T 


6385 


C 




T 


G 


6813 


c 




T 


A 


7853 


c 




T 




12973 


G 




C 


T 


12973 


A 




G 


T 


13012 


G 




A 


C T 


13072 


A 




T 


C G 


13370 


C 




G 




13682 


C 




T 


A 


14631 


c 




T 


G 



Context : 

DNA 

Position 

222 6 TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 
TCCTCCCTACCCCCAGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTACCTGGGACC 
CTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGCCCTGGGAGGTCA 
GGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTTGATGGCCA 
TGGTAAGTCAAGGGCTGCTAGGCCCTCCGCTCACAGCCTGCCACCACTTACTGGTGTGTG 
tG,C,A} 

CCTTTGCACATGGCTTAGTCCCTCTGTTGCCTCATCTGTCAAATGGAGTGATAACAGTGC 
CCATCAGCCGGGTGCAGTGGCTAGTGCCTGAAATCCCAACACTTTGGGAGGCGGAGGTGG 
GTGGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 
CTACTAAAAATATAAAAATTAGCTGGGCATGGTGGTGCGTACCTGTAATCCCAGATACTT 
GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGATGTTGCAGTGAACCAAGA 

2226 TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 
TCCTCCCTACCCCCAGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTACCTGGGACC 
CTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGCCCTGGGAGGTCA 
GGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTTGATGGCCA 
TGGTAAGTCAAGGGCTGCTAGGCCCTCCGCTCACAGCCTGCCACCACTTACTGGTGTGTG 
[C, A,GsT] 
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CCTTTGCACATGGCTTAGTCCCTCTGTTGCCTCATCTGTCAAATGGAGTGATAACAGTGC 
CCATCAGCCGGGTGCAGTGGCTAGTGCCTGAAATCCCAACACTTTGGGAGGCGGAGGTGG 
GTGGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 
CTACTAAAAATATAAAAATTAGCTGGGCATGGTGGTGCGTACCTGTAATCCCAGATACTT 
GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGATGTTGCAGTGAACCAAGA 

3081 AAACAGTTCTCTGTATTACAACAGAAAGCAGGAGGCCCATGCTGGGTGCTGCCAGGAACT 
CAGTAGTAACTAAGACAGCACCGGTGCTGCTTCCCCAGCGCACCTAGGCCAGTGGGGAAA 
CAGACTCACCACACAGTCCCAGCCCAGAGTGGTCAGGGCCAAGATGGGGAAGCACGGGGA 
GAAAGGTCAGGGTGGGATGGGGAGGGGTCAGGGCAAGAGGGGTCAGGGCCAGGCTGAGGG 
AAGCCCTGGGACTGTAGGAATTTAGAGGAGGTACCTGACCCGGCATGTTTGGTGAGGGAG 
[A,T,GsC] 

TTCAGGAAGTCTTCCTGGAAGAGAGGCTGTCGGAGCTGAGACTCATAAGATGAGTGGGGA 
GGGTGTTCCAGGCAGAAAGACCAGCACCTACAAAAGCATGACTTTGAGAGAAGCATTCAT 
CCATTCAACTGATGAATTTTCAGACTGGGCACGCTGGCTCATGCCTGT7VATCCCAGCACT 
TTGGAAGGCTGAATGGGGAGGATGACTTGAGCCTAGGCATTTGTGACAAGCCTGGGCAAC 
ATGGTGAGACCCTGCCTCCACAAAACAAACAAACAAACAAAAAATCATTATACCTGGTAC 

3788 ATCCTAAAATGAGAGGTACAGATTTGAGAGCAAACACAGGGCACAGGCATATGTACGAGG 
GTAAAGAGGGAATCAGGGAAGGCTTCTCAGAGAAGGTGACATTTAAGCCGGGACATGAAG 
GATGAACGAGTTAGTTCACCAAGGATGGGATGGAAAGGGGTGAGAGTGATGGAGGCAGAG 
GGAACTGCAGGATCATAGGCCTAGACAGGGGATCCTGACGCCCTTGAGGAAGTGAGAGAA 
GACCAGCGCAGTCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTC 
tA,T] 

TGCAGGGCCTGTGAGTCACGTCAGAGTGTTTGGGCTTTTGTTTTTCCTGGGAGCAGTCGA 
TTTTAAGCAGGGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGG 
GGATGAAACTGGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGC 
TGGGAGGTCCGCGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATT 
GGGAGAGGCCTGGGGCTCTGGCCGAGGAATGGATGGTGGGCTGAAACAGGGAGAGGAGAG 

3979 ATCATAGGCCTAGACAGGGGATCCTGACGCCCTTGAGGAAGTGAGAGAAGACCAGCGCAG 
TCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTCATGCAGGGCCT 
GTGAGTCACGTCAGAGTGTTTGGGCTTTTGTTTTTCCTGGGAGCAGTCGATTTTAAGCAG 
GGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGGGGATGAAACT 
GGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGCTGGGAGGTCC 
[T,C,G] 

CGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATTGGGAGAGGCCT 
GGGGCTCTGGCCGAGGAATGGATGGTGGGCTGAAACAGGGAGAGGAGAGATGCTTAGGCC 
ACTTTGGAACACAGTAGGGCAAGGACAGGAGACACCCAAGGGGAAGTGCCCAAGAGACCA 
CGACAGGCTGGCATTGGACAGGGAAGGTCTGTCTGGAGCAGGTGTCTTGGATAAGGGAGG 
AAAATGGTGCAGTTCCATCCTCCTCCCTCTCTGTTCAACCTCTAAACTACATGGGGCACA 

5056 AGTTTGATTATGCATTTATTGAGCACCTACTGAGTCCTCATCCCTGGGCTAGGCTGGAAT 
GGACTCAGATGGAGCCTGAAGAGTCCCCCTCAGGG7VACCTCACTAGAAAGAAGGAGGAAT 
CGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAACACTTTGGGAGGCTGAGGTGGGTGG 
ATCACAAGGTCAGGAGATCGAGACCATCCTGGCTAACACAGTGAAACCCCATCTCTACTA 
AAAATACAAAAAATGAGCCAGGCATGGTGGCGGGCGCCTGTAGTCCCAGCTACTCAGGAG 
[T,C,G3 

CTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGACGAGATCACGC 
CACTGCACTCCAGCCTGGGCAACAGAGCGAGATTCCGTCTCAAAAAAAAAAAGAAAGAAA 
GGAAGAAGGGGGAATGGGGGAGAGGGGCCGGTCCCTTTTTGAGTCTAGCCTTCTGCGCAG 
GGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCTGAGGAAGTTTACCATGCTTGCTC 
TGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGAGCTGATCCAGGCGGAGGCCCGGT 

5213 CTTTGGGAGGCTGAGGTGGGTGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCTAAC 
ACAGTGAAACCCCATCTCTACTAAAAATACAAAAAATGAGCCAGGCATGGTGGCGGGCGC 
CTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGA 
GGTTGCAGTGAGACGAGATCACGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGATTCC 
GTCTCAAAAAAAAAAAGAAAGAAAGGAAGAAGGGGGAATGGGGGAGAGGGGCCGGTCCCT 
[T,G,A,C] 

TTTGAGTCTAGCCTTCTGCGCAGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCT 
GAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGA 
GCTGATCCAGGCGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGGTCAGCA 
TGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCTCCTACTGTGGCTG 
GGGGTGGCCCCAACCCAGGTCCTGGAATGGGCAGGAGGGGAAGCCTTGAACTCTAGGGCT 

5508 TCCCTTTTTGAGTCTAGCCTTCTGCGCAGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAG 
GCAGCTGAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGG 
CGAGGAGCTGATCCAGGCGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGG 
TCAGCATGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCTCCTACTG 
TGGCTGGGGGTGGCCCCAACCCAGGTCCTGGAATGGGCAGGAGGGGAAGCCTTGAACTCT 
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[A,C,T,G] 

GGGCTGGCCTGGGGGTTCTGTTCACTGCCACCTTCTGTCTCTGTCCCACTGTCTCTCCGA 
GGCTGTCATGACATCTCTCTGTGTGTCTCTGGTGCTATCATCCCATTCTTCCTGGGTCTC 
CATCTCTCTCTCTGTCTCTTTTCTTTCTCTCTCCTTTCCTCTATTTTTTGGGCCCTCAGT 
CTATCTCTGTTTCTGTCTCCCTGTCTGTGTGATGGTCACTCTGTTTCTTTCTCCCTGTCT 
GTTTCTCTGTCCCTATCTGTCTGTATCCTTCTTTGCCTGTTTAGCTCTCTCCCTGCGCTG 

5857 CTGTCTCTCCGAGGCTGTCATGACATCTCTCTGTGTGTCTCTGGTGCTATCATCCCATTC 
TTCCTGGGTCTCCATCTCTCTCTCTGTCTCTTTTCTTTCTCTCTCCTTTCCTCTATTTTT 
TGGGCCCTCAGTCTATCTCTGTTTCTGTCTCCCTGTCTGTGTGATGGTCACTCTGTTTCT 
TTCTCCCTGTCTGTTTCTCTGTCCCTATCTGTCTGTATCCTTCTTTGCCTGTTTAGCTCT 
CTCCCTGCGCTGTCCATCCATCTTTCCCTGCCTCCCTGTCTCTCTCTGGTTGGGTTCAGC 
[C,G,AsT] 

CCAACCTGCTCCCCTCTGCCTGGCTCCATCACAGCCTACCTCCCTGCCCCCATTCCCCCC 
AGGACGCCCATTCGATCCCTCCCTGCTGCTGGCCCAGGCCACCTCCAACGTAGTCTGCTC 
CCTCCTCTTTGGCCTCCGCTTCTCCTATGAGGATAAGGAGTTCCAGGCCGTGGTCCGGGC 
AGCTGGTGGTACCCTGCTGGGAGTCAGCTCCCAGGGGGGTCAGGTGAGTGGGTGGGACCC 
CTCTCCAACTACCTTCCCTGAAGGTTCCTGCCAAGGTCCCATGAGAACTAGCTGCCCTTC 

6385 GTGGGTGGGACCCCTCTCCAACTACCTTCCCTGAAGGTTCCTGCCAAGGTCCCATGAGAA 
CTAGCTGCCCTTCTCCCCACAGACCTACGAGATGTTCTCCTGGTTCCTGCGGCCCCTGCC 
AGGCCCCCACAAGCAGCTCCTCCACCACGTCAGCACCTTGGCTGCCTTCACAGTCCGGCA 
GGTGCAGCAGCACCAGGGGAACCTGGATGCTTCGGGCCCCGCACGTGACCTTGTCGATGC 
CTTCCTGCTGAAGATGGCACAGGTGTGGGAAGGGTGCAGGGACCCCCTCTCTGAATGGGC 
[C,T,G] 

TGGTGACCTGGCAGGTCCCAGCCAGGTGTCCCTGGGGACCTCAATTGGGTTCCTCTCTCT 
TTCTCTCTCTGCATGTCTCTGTGAGTATGAGTGTCTCTGTGCATGTGTGTGCATCCCTTC 
TCTGCACATCTGTGCTGGCCCTTTCAGGGCGTTGCTCTCACTGCCTCTCCCGCCCCCGAC 
CTGGGCATTTGTGCCGGGCTGTCTGTCTCTCCAGCATCTCTCCTCTTTCTCCCTCCCACC 
TCGGCCCTTGTGTTCAGGCCCCATGCCCAGGGTCCTACACCAGCAATCCCCAGGATCACT 

6813 ATCTGTGCTGGCCCTTTCAGGGCGTTGCTCTCACTGCCTCTCCCGCCCCCGACCTGGGCA 
TTTGTGCCGGGCTGTCTGTCTCTCCAGCATCTCTCCTCTTTCTCCCTCCCACCTCGGCCC 
TTGTGTTCAGGCCCCATGCCCAGGGTCCTACACCAGCAATCCCCAGGATCACTTCATCCC 
ATCCCCTGCAGCCTCCCCAGACTTTTATGTAAATTCACAATTTTATGTGAATTATGGTCA 
TTTATTAGGAAGCCTTGCAATATCAAGTTATGTTAATAAAGTCCACTTTATTAATTATAT 
[C,T,A] 

AGAACAATATTTCTTTTCCTTTTTTTTTTCTTTTCTTTTTAAAGAGACAGGATCTCTTTC 
TGTTGCCCAGGCTAGAGTACAGTTGCAAAATCATAGCTCACTGCAACCTTGAACTCCTGG 
GCTCAAGCAATCCTCCTGCCTCGGGCTCCTGAGTAGCTGGGACAACAGGTGTGCACCACC 
ACACCTGGCTAAATTTTTTTTTTTTCTTTGTAGAGATAGACTCTCACTATGTTACCCAGG 
CTGGTCTTGAATTCCTGGGCTCATGTAATCCTCCTGCTGCCTTGAACTCCCAAAGTGCTG 

7853 TCTTTAATTAAATGCAGCCTCCCTGTTTGTGGAGAAAGAAAAATTTTTCTTAACCCTATT 
GCCCCATTTCTTTTCTCTTTTATTGAATATTTTTTAGTTTTAACTATAGTAAAATACACA 
TAACGTTTACCATCTTAACCATTTTTAGGTATACAGTACAGTAGTGTTCAGTACATTCAT 
ACTGTTATGCAATCAGTCTCCAGAACTCTTCATGTTGCAAAGCTGAAACTCTATACCCAT 
TAAACAACTGCCTGTTCCTCCCTCCTCCAACCCCTGGCAATCACCTTTTTTTTTTTGAGA 
[C,T] 

GAAGTCTCACTCTGTCACCCAGGCTAGAGTGCGGTGGCTCGATCTCGGCTCACTGCAAGC 
TCCGCCTCCCGGGTTCATGCCATTCTCCTGCCTCAGTCTCCCAAGCAGCTGGGACTACAG 
GTGCCCGTCACCACGCCTGGCTAATTTTTTGTATTTTTAGTAGAGATGGAGTTTCATCGT 
GTTAGCCAGGCTGATCTCAAACTCCTGGCCTCAAGTGATCCACCCGCCTCGGCCTCCCAA 
AGTGCTGGGACTACAGGCGTGAGCCACTGTGCCTGGCCAGGAAGTAGACTCTTGATATTA 

12973 CTGTCTTCCTCTCTTTCTCTCTTTCTTTCTTGACAGGGTCTCTCTTTGTCTCCCAGGCTG 
GGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAACTCCTAGGCTCAAGTGATCCT 
CCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 
TTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCCCAGGCTGGTCTCAAACTCCTG 
GGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
[G,C,T] 

GCACGCCACTCAACACTCCACAAATGTTGATGCCATTATGTTTTGTGAACTAGTGTCCCT 
GGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCC 
TGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCTTCTCCTTAGGTATCTGCTGCA 
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12973 CTGTCTTCCTCTCTTTCTCTCTTTCTTTCTTGACAGGGTCTCTCTTTGTCTCCCAGGCTG 
GGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAACTCCTAGGCTCAAGTGATCCT 
CCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 
TTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCCCAGGCTGGTCTCAAACTCCTG 
GGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
[A,G,T] 

GCACGCCACTCAACACTCCACAAATGTTGATGCCATTATGTTTTGTGAACTAGTGTCCCT 
GGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCC 
TGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCTTCTCCTTAGGTATCTGCTGCA 

13012 CTCTCTTTGTCTCCCAGGCTGGGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAA 
CTCCTAGGCTCAAGTGATCCTCCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCA 
TACCACCATGCCTGGCTAATTTTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
GGGATTACAGGCATGAGCCACTGCACGCCACTCAACACTCCACAAATGTTGATGCCATTA 
[G,A,CsT] 

GTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATA 
GACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCT 
TCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 

13072 CTCCTAGGCTCAAGTGATCCTCCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCA 
TACCACCATGCCTGGCTAATTTTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
GGGATTACAGGCATGAGCCACTGCACGCCACTCAACACTCCACAAATGTTGATGCCATTA 
TGTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTTGTACTCCACACTCGAGGACCAAAT 
[A,T,CsG] 

GACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCT 
TCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 
ATCTGTGCCAGCTGGGGGCACCCTTCTGCACCCTGGGCTTACTGTTGGCTCCTCCACCTG 

13370 ATAGACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGG 
TCTTCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGT 
TCAACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGC 
CCTTCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCA 
GGCATCTGTGCCAGCTGGGGGCACCCTTCTGCACCCTGGGCTTACTGTTGGCTCCTCCAC 
[C,G] 

TGCTGTTCCCCCCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCT 
CTCCTCACCAGGGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGAGCTCTTCCT 
CTTCTTCACCACCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCC 
TGAGCCTCAAGCCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCAAG 
TCCGTCCCACTGACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGT 

13682 CCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCTCTCCTCACCAG 
GGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGAGCTCTTCCTCTTCTTCACCA 
CCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCCTGAGCCTCAAG 
CCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCAAGTCCGTCCCACT 
GACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGTGGTGGGTGCCC 
[C,T,A] 

GGACGGTGCCTCCAGCCTCAACAGTGGGCATGGACAGGGTTAATGTCTCCAGAGTGTACA 
CTGCAGGCAGCCACATTTACACGCCTGCAGTTGTTTTCCGGAGTCTGTCCCACGGCCCAC 
ACGCTCACTTGACTCATGCTGCTAAGATGCACAACCGCACACCCATACACAACTACAAGG 
GCCACAAAGCAACTGCTGGGTTAGCTTTCCACAGACATAAATATAGTCCATCTGCAATCA 
CAAGCACATAGCCAGGTAACCCACCAACTCCCCTGGATCTGCAGCCCACACGTGGGAGTC 
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14 631 GCAGCCCTCAGTCACCCCTTTTTAAGCACCCTGATTCTACCAAATGCAAACACATCTGGG 
TCTGCGATTATGCACAGAGACTTTGGACATACGAGGACCCTCAGACCGGAGGAACACCTG 
CCCAACCCCAACACGTGCTTATGTAACCACGTGGAAAGCGGCCCCTGCTGCCCCTCCACA 
CACACATACACACTCACTGATCTACAGCCCCTGTTCGGCGTCAGAGTCCCCACTAGACCC 
AGTGGAAGGGGTTAGAGACCAAGTAGGGGCCAGTTTCCAATTCACCCTGTCAGGGAGTGA 
[C,T,G] 

CCGGATCTGACGTTCCTTGTGACTTAAGGGTCCGGCTTGGGAATTAAAGTTTGTTTCTGG 
CCTTTAGCCTACTGCGTGTGTGACCCGTGTCAGTCACTGTGAGTAAGGGGTGGGGACAGG 
GGAGTCCACCCCTCCCCTGAGGCTGGGCGGGAGCTGAAAAACATGGCCACCGCCCACCCT 
GGCTGTTGACATCAGGACCAGATGTGGAGCTGGGAGGAGGGGCAGGGCTGGTGACGCCCT 
GGGCCTCATTTCCAAAAAGGGCCAAGGTGTCCGGCGGTGGGAAGTGGGCAAGGAGGGGGT 

Chromosomal Map Position 
ePCR to dbSTS 

Site (bases) Marker Chr. Organism 

15155. .15288 stSG46708 19 Homo sapiens 
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